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METHOD FOR TUNING A DATA PROCESSING SYSTEM FOR SETTING UP A 
HIGH-AVAILABILITY DATA PROCESSING ENVIRONMENT 

Technical field 

The present invention relates generally to data processing 
5 systems and, more particularly, to a method of tuning a data 
processing system with the purpose of defining and setting up 
a fault-tolerant or high-availability data processing 
environment • 

Background art 

10 Data processing systems are pervading almost every field 

of human activity. Fault tolerance of data processing systems 
is thus becoming a major issue not only in very specific 
fields, such as those wherein any possible malfunctioning may 
put human life at risk (e.g. , data processing systems used in 

15 avionics), but also in any field where it is important to 
always guarantee a full or at least a basic system 
functionality, for example not to cause business losses to an 
enterprise . 

A fault-tolerant data processing environment, also 
20 sometimes referred to as a Highly-Available or 
High-Availability (shortly, HA) data processing environment, 
is one in which at least those tasks that are considered more 
critical for the data processing system owner are guaranteed 
even in presence of malf unctionings , e.g., those tasks, 
25 commonly referred to as production tasks, essential for the 
activity or business of a company. 

Data processing systems normally include many components 
or resources, typically arranged in a data communication 
network configuration. Data processing system components may 
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include server machines, such as network servers, file servers 
and production servers, network terminals, for example 
personal computers and/or workstations, mass storage devices, 
printers, application software, infrastructure software, 
5 software drivers, user libraries of software objects, data 
archives and so on. 

A known technique for setting up an HA data processing 
environment in a data processing system calls for implementing 
certain data processing system resources in a redundant 
10 fashion. For example, a so-called cluster configuration is 
defined wherein the resources (e.g., the central processing 
unit, the mass storage devices and the like) associated with a 
given node of the network (e.g., a production server machine) 
are linked to each other so that they can be moved altogether 

15 (mirrored) to a back-up or mirror node so as to ensure 
redundancy for high availability; in the event of a failure of 
the main node, the back-up node is capable of taking over some 
or all of the work of the failed node. 

In many cases, an HA data processing environment needs to 

20 be set up in an already existent data processing system. For 
example, a company relying for its business on an already 
existing data processing system may desire to improve the 
features and reliability thereof by implementing an HA data 
processing environment . 

25 Implementing an HA data processing environment in an 

existing data processing system is a challenging task. A key 
element for implementing a good HA data processing environment 
is the analysis and assessment of the existing data processing 
system, and the tuning thereof in view of the results to be 

30 achieved in terms of reliability and fault tolerance. Highly 
qualified people need to analyse the existing data processing 
system and identify the customer's needs in terms of 
realiability and fault tolerance features; on the basis of the 
information gathered, essentially by personally interviewing 

35 the customer ("i.e., the owner of the data processing system), 
and of their expertise, these experts generate a project 
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report in which the actions deemed necessary to set up an HA 
data processing environment with the desired features are 
listed. The quality of the analysis performed by these experts 
strongly affects the quality of the HA data processing 
5 environment which will be set up on the basis of the analysis 
results . 

This process of definition of the actions needed for 
tuning the existing data processing system for the 
implementation of the HA data processing environment is not 

10 efficient under many respects. Firstly, it totally relies on 
the professional skills of a handful of extremely specialsed 
experts; since adequate training of people in this field of 
technology is very expensive and requires long time, even 
top-level providers of HA solutions for data processing 

15 systems do not have more than a few people capable of 
performing this task. Additionally, as any human activity, it 
is prone to errors; for example, the analysis of the existing 
data processing system may be incomplete, some key elements 
may pass unobserved or be underestimated, some critical issues 

20 may not be discussed with the customer, the customer may not 
be (and normally is not) able to provide all the information 
requested by the HA expert, or the customer's answers may not 
be deeply understood by the HA expert. Furthermore, the 
analysis of the existing data processing system is necessarily 

25 long, and thus costly. 

An HA data processing environment developed on the basis 
of an unsatisfactory analysis of the existing data processing 
system may cause serious problems during the data processing 
system operation, and these problems are often encountered 

30 when HA functionalities are relied upon, e.g. in case of 
system crashes. 

In view of this, it has been an object of the present 
invention to avoid the necessity of completely relying on the 
skills of a human HA expert for the definition of an HA data 

35 processing environment in an existing data processing system. 
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Another object of the present invention has been to 
reduce errors in the process of definition of the HA data 
processing environment for an existing data processing system. 

Still another object of the present invention has been to 
5 make the process of definition of an HA environment for an 
existing data processing system faster, not impairing, even if 
temporarily or partially, the functionality of the existing 
data processing system to be analysed, and less annoying and 
time-consuming for the owner of the data processing system. 

10 Summary of the invention 

Accoding to the present invention, a method of analysing 
and for tuning a data processing system for the purpose of 
setting up an HA data processing environment is provided as 
set forth in appended claim 1. 
15 In brief, the method comprises defining a set of 

high-availability rules, and defining a set of parameters to 
be obtained, indicative of the compliance of the system with 
the high-availability rules. 

The set of parameters is then obtained; this act includes 
20 automatically inspecting the data processing system for 
identifying and collecting data processing system parameters. 

The obtained set of parameters is automatically evaluated 
by applying the high-availability rules, and, responsive to 
said act of evaluating, a set of tuning actions is determined 
25 for the setting up of the highly-available data processing 
environment . 

Brief description of the drawings 

The features and advantages of the present invention will 
be made apparatent by means of the following detailed 
30 description of an embodiment thereof, provided merely by way 
of non-limitative example, which will be made in conjunction 
with the attached drawing sheets, wherein: 
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Figure 1 



Figure 2 



Figure 3 



Figures 
4A, 4B, 4C 
and 4D 
Figure 5 

Figure 6 



Figure 7 



Figures 8 
and 9 



is a schematic view of an exemplary data 
processing system, particularly a computer 
network, in which an HA data processing 
environment is to be implemented; 
schematically shows the main functional blocks 
of a generic computer of the computer network, 
such as a production server computer; 
schematically shows the content of a working 
memory of the production server computer and 
of a client computer of the computer network, 
during the execution of respective parts of a 
client-server software tool implementing a 
method according to an embodiment of the 
present invention; 

are schematic flowcharts of a method according 
to an embodiment of the present invention; 

schematically shows a structure of a data 
processing system parameter database; 
is an exemplary menu page displayed to a user 
of the client computer, in a process of 
definition of the targets of the high 
availability environment to be defined; 
schematically depicts a process of applying an 
exemplary automatic analysis rule in the 
method of Figures 4A, 4B, 4C and 4D; 
schematically depicts two databases of a 
project datahase generated by the method of 
Figures 4A, 4B, 4C and 4D. 



De-bailed description of the preferred embodiment: 



With reference to the drawings, and particularly to Figure 
1, an exemplary data processing system 100 is schematically 
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shown. The data processing system 100 forms for example the 
data processing or computing infrastructure or, in jargon, the 
production environment of an enterprise, a bank, a public 
administration, an Internet service provider or the like. 
5 The data processing system 100 comprises a plurality of 

components 105a, 105b, 105c,..., 105n, for example Personal 
Computers (PCs) , workstations, printers, mass-storage devices 
and the like, arranged in a computer network configuration; 
the computer network, depicted only schematically in Figure 1 

10 and denoted therein by reference numeral 110, is for example a 
Local Area Network (LAN) such as an Ethernet network or an SNA 
(System Network Architecture) network, albeit it is intended 
that the present invention is not limited to any specific 
computer network configuration. 

15 The data processing system 100 includes at least one 

production server machine or production server computer, in 
the shown example represented by the data processing system 
component 105a; in the context of the present description, by 
production server computer 105a there is intended the computer 

20 that, in the data processing system 100, supports the 
application programs critical for the activity or business of 
the , owner of the data processing system. The production server 
computer 105a may also have other functions, e.g. functions of 
network server machine, managing the data traffic over the 

25 network 110, file server, database server and print server. By 
way of descriptive and non-limitative example, in the 
following it will be assumed that the production server 
computer 105a is a machine of the family iSeries Servers 
produced by IBM Corporation, equipped with the OS/400 

30 operating system, being intended that the present invention 
can be generally applied to any data processing system and, 
more specifically, to any computer network, irrespective of 
the server machine type, the operating system, the network 
architecture; examples of different operating systems are 

35 Windows NT, Windows 2000, OS/2, Linux. 
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As schematically shown in Figure 2, a generic computer of 
the data processing system 100 , for example the production 
server computer 105a, comprises several functional units 
connected in parallel to a data communication bus 203, for 
5 example of the PCI type. In particular, a Central Processing 
Unit (CPU) 205, typically comprising a microprocessor, e.g. a 
RISC processor, controls the operation of the production 
server computer 105a, a working memory 207, typically a RAM 
(Random Access Memory) is directly exploited by the CPU 205 

10 for the execution of programs and for temporary storage of 
data, and a Read Only Memory (ROM) 209 stores a basic program 
for the bootstrap of the production server computer 105a. The 
production server computer 105a comprises several peripheral 
units, connected to the bus 203 by means of respective 

15 interfaces. Particularly, peripheral units that allow the 
interaction with a human user are provided, such as a display 
device 211 (for example a CRT, an LCD or a plasma monitor) , a 
keyboard 213 and a pointing device 215 (for example a mouse or 
a touchpad) . The production server computer 105a also includes 

20 peripheral units for local mass-storage of programs (operating 
system, application programs, operating system libraries, user 
libraries) and data, such as one or more magnetic Hard-Disk 
Drivers (HDD), globally indicated as 217, driving magnetic 
hard disks, and a CD-ROM/DVD driver 219, or a CD-ROM/DVD 

25 juke-box, for reading/writing CD-ROMs/DVDs. Other peripheral 
units may be present, such as a floppy-disk driver for 
reading/writing floppy disks, a memory card reader for 
reading/writing memory cards, a magnetic tape mass-storage 
storage unit and the like. The production server computer 105a 

30 is further equipped with a Network Interface Adapter (NIA) 
card 221 for the connection to the computer network 110. 

Any other computer/workstation 105b, . . . , 105n in the data 
processing system 100 has a structure generally similar to 
that depicted in Figure 2, possibly properly scaled depending 

35 on the machine computing performance. 
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Let it be assumed that the owner - of the data processing 
system 100 desires to set up an HA data processing 
environment, so as to render the data processing system 100 
more reliable and fault tolerant; for simplicity of 
5 description, in the following it will be assumed that fault 
tolerance is desired in respect of at least those tasks, 
critical for the activity or business of the data processing 
system owner, that are managed by the production server 
computer 105a. Examples of the tasks that can be managed by 

10 the production server computer 105a are accounting tasks, 
business management tasks, manufacturing process control 
tasks, workflow management tasks, e-commerce tasks, electronic 
messaging tasks, web hosting tasks. One or more of these tasks 
are performed by the production server computer by means of 

15 dedicated application softwares; typically, these application 
softwares have a client-server architecture, with a server 
software component installed and running in the production 
server computer 105a, while a plurality of client software 
components are installed and run in one or more client 

20 computers of the data processing system 100. 

From a practical viewpoint, implementing an HA data 
processing environment in the existing data processing system 
100 means grouping those resources of the data processing 
system 100 that are critical for the activities or business of 

25 the data processing system owner, and mirroring the resource 
groups, including for example an application program, the 
libraries and the data structures it relies upon. For example, 
in order to set up the desired HA data processing environment, 
at least one mirror or back-up production server 105a-bk 

30 (Figure 1) will have to be set up that is capable of hosting a 
copy or mirror resource group, so as to be able to take over 
the role of production server whenever the production server 
computer 105a, for any reason, is not able to perform the 
intended tasks. Depending on the specific cases, the machine 

35 intended to become the back-up production server 105a-bk may 
be chosen among the already existing machines 105b, . . . , 105n 
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of the data processing system 100, if one of the existing 
machines is found to be structurally adequate to perform the 
job of production server, or a new machine with suitable 
features may be added to the data processing system 100, by 
5 connecting it to the data communication network 110. The 
production server computer 105a and the back-up production 
server computer 105a-bk thus form an HA cluster 120 in the 
data processing system 100. 

As discussed in the introduction of the present 

10 description, the definition of the proper HA data processing 
environment for the data processing system 100 involves 
analysing and tuning the existing data processing system 100. 

According to an embodiment of the present invention, a 
method of analysing and tuning the data processing system 100 

15 for the definition and setting up of an HA data processing 
environment is carried on automatically or at least partially 
automatically, by means of an HA definition software tool. In 
particular, according to an embodiment of the present 
invention, the HA definition software tool has a client-server 

20 architecture. A server software component of the HA definition 
software tool, once installed, runs under the control of the 
production server computer 105a in the data processing system 
100 for which an HA data processing environment needs to be 
defined; a client software component of the HA definition 

25 software tool, once installed, runs under the control of a 
client computer in the data processing system 100; the client 
computer can be any one of the personal computers or 
workstations of the data processing system 100 or, preferably, 
a PC 115, e.gr. a portable PC exploited by a user (typically, a 

30 technician of a company providing services in the field of 
information technology) in charge of analysing the existing 
data processing system for defining and setting up the HA data 
processing environment, that is purposedly connected to the 
data communication network 110 and behaves as a client 

35 computer in respect of the production server computer 105a. 
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The client computer 115 has generally the structure shown in 
Figure 2. 

Figure 3 schematically shows a partial content of a 
working memory 105a/207 of the production server computer 
5 105a, and a partial content of a working memory 115/207 of the 
client computer 115, when the server software component and 
the client software compoment of the HA definition software 
tool are running on the production server computer 105a and 
the client computer 115, respectively - 

10 Considering the production server computer 105a, the 

server software component includes a server-side software 
agent 300 adapted to perform an automatic inspection of the 
data processing system 100 and to collect information relevant 
to the definition of the HA data processing environment for 

15 the data processing system 100. In particular, in an 
embodiment of the present invention, the server-side software 
agent 300 performs an automatic inspection of a file system of 
the production server computer 105a, stored on the hard 
disk(s) 105a/217 thereof, identifies and collects production 

20 server computer parameters that are relevant for the 
definition and setting up of the HA data processing 
environment. In - general, the type of parameters that "are 
automatically collected by the server-side software agent 
depends on the data processing system for which the HA data 

25 processing environment needs to be defined, for example on the 
type of production server computer and on its operating 
system; the type of parameters to be automatically collected 
is for example embedded in the code of the server-side 
software agent. 

30 The server-side software agent 300 stores the collected 

production server computer parameters in a production server 
computer parameter database 303; the production server 
computer parameter database 303 is preferably stored in the 
hard disk(s) 105a/217 of the production server computer 105a. 

35 A communication module 305 allows the server-side software 
agent 300 communicating with the client software component 

l 
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over the data communication network 110 (through the NIA card 
105a/221 of the production server computer 105a) . 

In the client computer 115, the client software component 
includes a communication module 307, allowing the client 
5 software component communicating with the server-side software 
agent 300 (through the NIA card 115/221 installed in the 
client computer 115) , an expert-system client-side software 
module 30 9, operating on the basis of a knowledge database 
311, and a Graphical User Interface (GUI ) module 313, allowing 

10 a user to interact with the expert-system client-side software 
module 309 through the display device 115/211 and the input 
devices 115/213 (keyboard) and 115/215 (mouse) of the client 
computer 115. As will be described in greater detail later on, 
the knowledge database 311 includes a database of standard 

15 questions to be answered for defining targets of the HA data 
processing environment, a database of automatic analysis 
rules, exploited by a rule-based automatic analysis engine 310 
of the client-side software module 309 for conducting an 
automatic rule-based analysis of the parameters available for 

20 the definition of the HA data processing environment, a 
database of additional questions that, as a result of the 
automatic rule-based analysis, may require an answer for 
obtaining additional parameters relevant to the definition and 
setting up of the HA data processing environment, and a 

25 database of recommendations/suggestions and of prescriptions 
of corrective or tuning actions to be performed on the data 
processing system 100 for the definition and setting up of the 
HA data processing environment. In the context of the present 
description, by additional questions there is intended one or 

30 more additional questions directed to obtaining, from the user 
and/or the owner of the data processing system 100, additional 
parameters relevant for the definition of the desired HA data 
processing environment, such additional parameters not being 
automatically retrievable from the data processing system by 

35 automatic inspection thereof, and such additional questions 
arising once from the analysis of the data processing system 
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parameters once the targets of the HA data processing 
environment have been defined. Also, in the context of the 
present description, by tuning- actions there is intended one 
or more corrective actions to which the existing data 
5 processing system 100 needs to be subjected in order to 
prepare and adapt it to the setting up of the desired HA data 
processing environment . 

In operation, the expert-system client-side software 
module 309 generates a project database 315. The knowledge 

10 database 311 and the project database 315 are for example 
stored on the hard disk 115/217 of the client computer 115. A 
database connectivity module 321, for example based on JAVA, 
allows the expert-system client-side software module 309 
interacting with the knowledge database 311 and the project 

15 database 315. 

A knowledge database managament software module 319, 
interacting with the GUI module 313 and the database 
connectivity module 321, allows the user to manage (e.g., 
update) the knowledge database 311. Exploiting the knowledge 

20 database managament software module 319, the user can update 
the knowledge database 311, e.g. for modifying, adding, 
deleting standard questions, rules for - the automatic 
rule-based analysis, and additional questions or prescriptions 
of corrective actions. 

25 By way of example, the communication modules 305 and 307 

allows communication between the production server computer 
105a and the client computer 115 on the basis of the TCP/IP 
communication protocol, and the database connectivity module 
319 is a Java database connectivity module. 

30 Figures 4A, 4B, 4C and 4D provide, in terms of schematic 

flowcharts, a general overview of a method according to an 
embodiment of the present invention. 

Referring to Figure 4A, as a first step the client 
software component, installed and running on the client 

35 computer 115 installs the server-side software agent on the 
production server computer 105a (block 400) . A server-side 
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software agent installation package is uploaded and stored 
into hard disk 105a/217 of the production server computer 
105a, and a remote installation procedure is launched by the 
client software component so as to cause the installation of 
5 the server-side software agent on the production server 
computer 105a. Alternatively, the server-side software agent 
can be installed on the production server computer 105a by 
inserting into the production server computer CD-ROM/DVD 
driver 105a/219 a CD-ROM/DVD with the server-side software 

10 agent installation package stored thereon; once the CD-ROM/DVD 
is inserted into the driver, a manual or automatic 
installation procedure is launched, leading to the 
installation of the server-side software agent; in this case, 
the actions schematised by block 400 are not performed by the 

15 client software component. 

After the server-side software agent has been installed on 
the production server computer 105a, the client software 
component invokes the server-side software agent (block 403) . 
The invocation of the server-side software agent by the client 

20 software component causes a production server inspection 
routine to be launched on the production server computer 105a. 
In particular, the server-side software agent 300 is launched 
(block 405) , and an automatic exploration and inspection of 
the production server computer 105a is performed, particularly 

25 of the production server computer file system 317 stored on 
the hard disk(s) 105a/217 r so as to identify and collect all 
production server computer parameters relevant to the 
definition of the HA data processing environment (block 407) . 
Preferably, the automatic inspection of the production server 

30 computer is carried on by submitting on the production server 
computer 105a batch processes that do not interfere with the 
normal productive tasks that the production server computer 
105a is intended to perform. For example, in the case the 
production server computer 105a is a machine of the family 

35 iSeries Servers by IBM Corporation, the automatic inspection 
routine is carried on by executing batch processes coded in 
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CLP and RPG ILE language, and system Application Program 
Interfaces (APIs) are exploited to reduce the impact of the 
automatic inspection routine on the normal productive tasks. 

The production server computer parameters searched for and 
5 collected by the production server computer inspection routine 
407 generally vary depending in particular on the type of 
machine used as production server and on the type of operating 
system installed thereon. In the non-limitative example of a 
production server computer of the family iSeries Servers by 

10 IBM Corporation, the production server computer automatic 
inspection routine 407 allows automatically collecting the 
following classes of production server computer parameters: 
class a) general information on the production server computer 
105a, including: 

15 al) the name of the production server computer 105a; 

a2) the type of machine used as production server computer 
105a; 

a3) the type and version of the Operating System (OS) 
installed on the production server computer 105a; 
20 a4) the mass-storage space available on the production server 
computer 105a, particularly on the hard disk(s) 105a/217 
thereof, and the exploited storage space; 

a5) the presence or absence of a magnetic tape mass-storage 
unit; 

25 a6) system values defining the behaviour of the production 
server computer 105a, e.g. system values defining the system 
auditing functionalities; 

a7) network attributes of the production server computer 105a 
in the computer network 110, e.g. the network identifier, the 
30 local location name, the name of the network server; 

class b) information on the structure of the file system of 
the production server computer, e.g., the list of folders and 
sub-folders in the file system; 

class c) a list of user libraries of software objects and data 
35 available on the production server computer, and a list of the 
software objects contained in each user library; 
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class d) a list of user programs, resident on the production 
server computer, that exploit OS commands, particularly OS 
commands that are considered relevant and, possibly, critical 
for the definition of the HA environment. 
5 The server-side software agent 300 then generates (block 

409) the server computer parameter database 303, in which the 
production server computer parameters, collected during the 
automatic production server computer inspection routine 407, 
are stored. It is intended that the server computer parameter 

10 database 303 may be generated during the execution of the 
automatic inspection routine 407, as the production server 
computer parameters are retrieved. 

The server computer parameter database 303 is preferably 
structured in such a way that logically coherent production 

15 server computer parameters are grouped together; referring to 
the above example of possible production server computer 
parameters collected by the production server computer 
inspection routine 407, the general information on the 
production server computer, the information on the file system 

20 structure, the list of user object libraries and the list of 
objects included in the libraries, and the list of user 
programs exploiting OS commands are stored in different files, 
particularly five files 500, 503, 505, 507 and 509, as 
schematically shown in Figure 5. 

25 In greater detail, the file 500, intended to store 

parameters of the class "general information on the production 
server computer" (class a in the above list) , is structured as 
a record having a plurality of fields (e.g., the fields 
PSNAME, OSVER, ASP, ESP, TAPE, AUDCT, AUDLV, NT ID shown in the 

30 drawing) , each field intended to contain one or more 
respective parameters such as the name identifying the 
production server computer (field PSNAME) , the installed OS 
version (field OSVER) , the available storage space on the hard 
disk of the production server computer 105a (field ASP), the 

35 exploited storage space, e.g. in percentage of the available 
storage space (field ESP) , the presence of a tape storage unit 
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(field TAPE) / the system values defining the production server 
behaviour, such as a system value defining the activation of 
auditing functionalities (field - AUDCT) and system values or 
settings defining the level of auditing (field AUDLV) , the 
5 network attributes of the production server computer, such as 
the network identifier (field NTID) , and the like. 

The file 503, intended to store parameters of the class 
^information on the file system structure of the production 
server computer" (class b listed above) , includes a plurality 

10 of records, each one corresponding to a respective 
folder/sub-folder (such as the folders ROOT, A, Al, B shown in 
the drawing) found in the file system 317 of the production 
server computer 105a* Each record has a plurality of fields 
{e.g. the fields FOLDNAME, FOLDPATH, FOLDDESC shown in the 

15 drawing), for storing the name of the f older/sub-folder (field 
FOLDNAME) , the folder/sub-folder path in the file system 
(field FOLDPATH) , a description of the folder/sub-folder 
(field FOLDDESC) and so on. 

The file 505, intended to store the list of user object 

20 and data libraries found in the production server computer 
105a, includes a plurality of records, each one corresponding 
to a respective library (e.g. the libraries LIBa and LIBb 
shown in the drawing) ; each record has a plurality of fields 
(e.g. the fields LIBNAME, LIBDSC, LIBSIZE, OBJ#, FILE#, 

25 DTAARA, DTAQUE, OUTQUE shown in the drawing) , for storing the 
library name (field LIBNAME) , a description of the library 
(field LIBDSC), the library size (field LIBSIZE) , the number 
of software objects present in the library (field OBJ#) , the 
number of physical files present in the library ( field FILE#) , 

30 the number of data areas (field DTAARA), of data queues (field 
DTAQUE) and of print queues (field OUTQUE) present in the 
library and so on. 

The file 507, intended to store the list of software 
objects found in the production server computer 105a, includes 

35 a plurality of records, each one corresponding to a respective 
software object (e.g. the software objects OBJa and OBJb shown 
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in the drawing); each record has a plurality of fields (e.g. 
the fields OBJNAME, OBJLIB, OBJTYPE, OBJDSC, OBJSIZE shown in 
the drawing) for storing the name of the respective object 
(field OBJNAME) , the user library containing the object (field 
5 OBJLIB), the type of the object (field OBJTYPE), a description 
of the object (field OBJDSC), the object size (field OBJSIZE) 
and so on* 

Finally, the file 509, intended to store the list of user 
programs found in the production server computer 105a and 

10 exploiting OS commands, includes a plurality of records, each 
one corresponding to a respective user program (e.g. the user 
programs PRGa and PRGb shown in the drawing) ; each record has 
a plurality of fields (e.g. the fields PROGNAME and OSCMD 
shown in the drawing) for storing the user program name (field 

15 PROGNAME, the exploited OS command (field OSCMD) and so on. 

The client software component periodically checks the 
execution progress status of the production server automatic 
inspection and server parameter database generation routines 
407 and 409 (block 413) . When the production server automatic 

20 inspection and server parameter database generation routines 
407 and 409 are completed, the server-side software agent 
notifies the client software component (block 415) . The client 
software component then activates a file transfer routine to 
download the server parameter database 303 from the production 

25 server computer 105a (blocks 417 and 419) . A copy 421 of the 
server parameter database 303 is thus created locally to the 
client computer 115, and it is stored in the hard disk 
115/217 of the client computer 115. 

After the local copy 421 of the server parameter database 

30 has been created, the client software component calls a 
routine for the definition of the HA data processing 
environment (block 423) ; the HA data processing environment 
definition routine 423 will be described in detail later on, 
in conjunction with Figure 4B. 

35 At the exit of the HA data processing environment 

definition routine 423, the client software component checks 
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whether the HA data processing environment definition routine 
423 has requested a new inspection of the production server 
computer 105a to be carried on (block 425) . In the negative 
case (exit branch N of block 425) , the process terminates, 
5 otherwise (exit branch Y of block 425) the process flow jumps 
back to block 403, ' the server-side software agent 300 is 
invoked again, and all the actions previously performed are 
repeated. 

Referring now to Figure 4B, a flowchart providing a 
10 general overview of the HA environment definition routine 423 

is shown. The routine 423 works as an expert system, on the 

basis of the knowledge database 311; the knowledge database 

311 forms the base of knowledge exploited by the HA data 

processing environment definition routine 423. 
15 In particular, in an embodiment of the present invention, 

the knowledge database 311 is composed of four different 

knowledge databases 443, 445, 447 and 449. 

The knowledge database 443 contains a list of default or 

standard questions that need to be answered, by the user 
20 and/or the owner of the data processing system 100, in order 

to get parameters defining the targets of the HA data 
• processing environment to be set up for the data processing 

system 100. 

The knowledge database ^445 is a database of pre-defined 
25 rules that are exploited by the automatic analysis engine 310, 
invoked by the HA data processing environment definition 
routine 423, for automatically analysing the available 
parameters and tuning the data processing system 100, with the 
purpose of defining the HA data processing environment and 
30 preparing the data processing system 100 for the 
implementation of the HA data processing environment. 

The knowledge database 447 contains a list of additional 
questions that, depending on the specific situation, i.e. on 
the result of the automatic analysis of the available 
35 parameters, the HA data processing environment definition 
routine 423 may need to be answered by the user and/or the 
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owner of the data processing system 100, for getting 
additional parameters necessary to define the HA data 
processing environment. 

The knowledge database 449 contains a list of 
5 recommendations/suggestions for the definition and setting up 
of the HA data processing environment. Additionally, the 
knowledge database 44 9 contains a list of prescriptions of 
corrective actions that, depending on the specific situation, 
it may be necessary to implement in the data processing system 
10 100 for preparing it to the setting up the desired HA data 
processing environment . 

In an embodiment of the present invention, the knowledge 
databases 443, 445, 447 and 449 are structured as extensible 
Markup Language (XML) files. 
15 The HA data processing environment definition routine 423, 

once launched, generates the project database 315 <. 

In an embodiment of the present invention, the project 
database 315 is composed of three project databases 453, 455 
and 457. 

20 The project database 453 contains the production server 

computer parameters, collected by the server-side software 
agent 300 and downloaded by the client software component from 
the production server computer 105a. The project database 453 
substantially coincide with the local copy 421 of the server 

25 parameter database 303, that is created locally to the client 
computer 115; in particular, referring to the example provided 
above, the project database 453 includes five files, 
corresponding to the five files 500, 503, 505, 507 and 509 of 
the server parameter database 300 shown in Figure 5. In 

30 addition to the record fields present in the corresponding 
files of the server parameter database 303, some files of the 
project database 453 include an additional record field, for 
the selection of the corresponding resource for mirroring 
purposes (as will be better described in the following) ; in 

35 particular, as shown in phantom in Figure 5, an additional 
record field SLTFLD in each record of the file of the project 
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database 453 corresponding to the file 503 of the server 
parameter database 303, and an additional record field SLTLIB 
in each record of the file of the project database * 453 
corresponding to the file 505 of the server parameter database 
5 303, allow storing an indication that the respective 
folder/sub-folder or, respectively, user library has been 
selected for mirroring. 

The project database 455 contains a list of the questions 
(the standard questions listed in the knowledge database 443 

10 and, possibly, one or more of the additional questions listed 
in the knowledge database 447) issued during the execution of 
the HA data processing environment definition routine 423, 
together with the respective parameter/parameters derived from 
the answers obtained. 

15 The project database 457 contains a list of 

recommendations/suggestions and a list of tuning or corrective 
action prescriptions generated by the HA data processing 
environment definition routine 423 . 

Preferably, the project database 315 is created or saved 

20 in a project database repository (not shown) , containing all 
the project databases of all the HA data processing 
environment definition projects already generated. In this 
case, before starting the process for analysing the data 
processing system 100 and defining the desired HA data 

25 processing environment, the data processing system 100 latter 
is defined by the user and a respective project database 315 
is created within the repository. 

In an embodiment of the present invention, the project 
databases 453, 455 and 457 are DB2, Oracle, or Informix 

30 relational databases, accessible through Structured Query 
Language (SQL) commands. Additionally , the project databases 
are accessible through the database connectivity module 317. 

When the HA data processing environment definition routine 
423 is invoked, an HA targets definition procedure is launched 

35 (block 459) ; during the execution of the HA targets definition 
procedure 459, the knowledge database 443 is accessed and the 
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list of standard questions is retrieved. Through the GUI 313, 
a menu is displayed on the display device 115/211 of the 
client computer 115* Through the menu, the user is guided in 
the process of inputting the parameters necessary to define 
5 the HA data processing environment targets, possibly obtaining 
the necessary answers from the owner of the data processing 
system 100. Figure 6 schematically shows an exemplary menu 
page 600; it is intended that the menu may include more than 
one menu page. The menu includes a plurality of text boxes 

10 603,..., 609 and associated input boxes or frames 611,..., 
623, such as check boxes and selection lists, so as to enable 
the user entering the information for answering the standard 
questions needed to define the HA data processing environment 
targets, using the input devices 115/213 and 115/215 of the 

15 client computer 115. Typical standard questions, which appear 
in descriptive form in the menu page as text boxes, may 
include questions on the number of production servers involved 
in the HA data processing environment to be set up, the 
functions that the back-up production server (s) 105a-bk 

20 is (are) intended to perform (merely back-up functions or 
additional functions), the computer network infrastructure 
(i.e., the' type of data communication network 110), the 
network protocol (s) used by the production clients for 
communicating with the production server (s), and so on. In 

25 general, the standard questions are directed to get 
information relevant for the definition of the HA data 
processing environment, and that cannot be automatically 
retrieved by the automatic inspection routine 407 from the 
production server computer 105a. 

30 The user, interacting if and when necessary with the owner 

of the data processing system 100, gets the required pieces of 
information and inputs them through the menu page 600. 

The list of standard questions and the associated answers 
are stored in the project database 455. In particular, as 

35 schematically depicted in Figure 8, the project database 455 
is structured as a table having an entry for each standard 
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question got from the knowledge database 443; any entry of the 
table includes a first field (the field QUESTION in the 
drawing) in which a description of one of the standard 
questions get from the knowledge database 443 is stored (for 
5 example, a character string) , and a second field (the field 
ANSWER) in which the associated parameter inputted by the user 
as an answer to the question is stored. 

In addition to the standard questions retrieved from the 
knowledge database 443 , the HA targets definition procedure 

10 459 guides the user in a process of selection of the resources 
of the data processing system 100 intended to be rendered 
highly available. For example, as shown in Figure 6, in the 
menu page(s) 600 text boxes 625,..., 629 and input boxes 631, 
633 guide the user in a process of selection of the folder or 

15 folders/sub-folders, in the file system 317 of the production 
server computer 105a, and of the user object and data 
libraries resident in the production server computer 105a, 
that needs to be mirrored in order to implement the desired HA 
data processing environment. To this purpose, the project 

20 database 453 is accessed, and the data concerning the file 
system structure (folders/sub-folders - file FILESYS) and the 
libraries resident on the production server computer 105a 
(file LIB) are retrieved; the list of folders/sub-folders and 
the list of libraries are displayed on the menu page, so that 

25 the user, selecting associated check boxes, can select the 
items that it is desired to mirror. Within the project 
database 453, the items selected during this process are 
marked as selected resources to be mirrored (record fields 
SLTFLD and SLTLIB in Figure 5) . Clearly, other items in 

30 addition to the folders/sub-folders and the user object and 
data libraries can be submitted to a selection process by the 
user, for example the selection may go down to the level of 
the single objects in the libraries. 

Once the HA targets have been defined, the HA targets 

35 definition procedure 459 ends. Next, an automatic analysis 
procedure 461 of the available data is launched. The automatic 
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analysis procedure 461 exploits the pre-defined rules in the 
knowledge database 445, which are applied to the data present 
in the project database 453 (containing the production server 
computer parameters automatically collected by the 
5 software-side software agent 300), in the project database 455 
and in the project database 4 57; it is observed that at the 
time the automatic analysis procedure 461 is launched the 
first time, the project database 455 only contains the list of 
standard questions and associated answer parameters obtained 
10 during the execution of the HA targets definition procedure 
459, and the project database 457 does not contain any 
recommendation/suggestion or prescription of corrective 
actions) . 

In particular, in an embodiment of the present invention 

15 the knowledge database 445 is an XML file containing all the 
pre-defined automatic analysis rules. A generic rule, when 
applied, can be verified or not verified, and produces as an 
output a response; the response may be a positive response, if 
i the rule is verified, or a negative response, if the rule is 

20 not verified- Each rule includes one or more conditions to be 
verified, and the rule is considered verified, and produces a 
positive response, if and only if all the conditions it 
includes are verified; if even only one of the rule conditions 
is not verified, the rule is considered not verified and a 

25 negative response is produced. A generic rule condition 
consists of an elaborated result, a comparison operator and a 
reference parameter; the elaborated result is a value obtained 
by accessing either one of the project databases 453, 455 or 
457, for example by applying an SQL command; the value 

30 retrieved from the specified project database is compared, 
using the specified comparison operator, to the specified 
reference parameter. If the comparison operator has a positive 
outcome, the respective rule condition is considered verified. 
Depending on the specific rule, the positive and negative 

35 responses of a generic rule are links to specific entries in a 
rule response database; the rule response database comprises 
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the knowledge database 447, containing the list of additional 
questions, and the knowledge database 449, containing the list 
of recommendations/suggestions and prescriptions of corrective 
actions . 

5 The following XML code fragment represents an exemplary 

rule definition: 



<rule 

responseCatego ry= "L IB " 
10 posit iveResponseKey="" 

negati veRespon s eKey= "keyO "> 
<condition 

fx el d- "DTAARA " 
tables="lib" 

15 where="DTAARA>0 and SLTLIB>"" 

regroupMethod=" * " 
comparisons" >" 
re f eren ceValue="0" 

/> 

20 <condition 

field="OSVER" 

tables^" sys" 

where="OSVER= ' V5R1M0 ' " 

regroupMe th od= " * " 
25 comparison^" >" 

referenceValue^"0" 

/> 

</rule> 

30 In this code fragment, keyO is the link to an entry in the 

rule response database that is accessed in case the 
application of the rule provides a negative response 
(negativeResponseKey="keyO") ; no entry in the rule response 
database is specified for a positive response 

35 (positiveResponseKey^"") . The exemplary rule shown includes 
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two conditions to be verified; in particular, in this 
examplary rule the two conditions require accessing the 
project database 453, For each condition, the file or table 
{lib, sys) in the project database 453 to be accessed and the 
5 respective record field (DTAARA, SLTLIB, OSVER) to be 
inspected are defined. Each rule condition contains one or 
more SQL commands or conditions (where="DTAARA>0 and 
SLTLIB>"", where="OSVER='V5R!M0 r ") , allowing to retrieve the 
records satisfying the SQL condition; in the shown example, 
10 all records in which the fields DTAARA and SLTLIB are not void 
are retrieved from the file LIB in the project database 453, 
and all the records in which the field OSVER contains V5R1M0 
are retrieved from the file SYS of the project database 453. 
An aggregation method for the retrieved records is also 
15 specified (the exemplary aggregation operator 

regroupM&thod="*" allows counting the number of records 
retrieved by applying the specified SQL condition) . In both of 
the exemplary rule conditions, the comparison operator is a 
simple majority operator ( comparlson=">") , and the reference 
20 value is zero ( reference Value="0") . 

Referring to Figure 7, when this exemplary rule is 
applied, the file LIB in the project database 453 is accessed, 
and it is ascertained whether, among the user libraries 
resident on the production server computer 105a, there are 
25 libraries containing data areas (field DTAARA not void), and 
whether such a library or libraries have been selected for 
mirroring during the execution of the procedure 4 59; as a 
second condition, the file SYS in the project database 453 is 
accessed, and it ascertained whether the operating system 
30 version (field OSVER) installed in the production server 
computer 105a coincides with the OS version V5R1M0. Since the 
second condition is not verified (the installed OS version 
differs from the specified version) , the rule has a negative 
response, the entry keyO is accessed in the rule response 
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database (in this example the knowledge database 44 9) , and the 
associated prescription of corrective action: 

Current OS version does not support journalisation of Data 
Area/Data Queue objects - Upgrade OS 
5 is added to the project database 457 (Figure 9), intended to 
contain the list of recommendations/suggestions and 
prescription of corrective actions. 

The flowchart of Figure 4C provides a general overview of 
the rule-based automatic analysis procedure 4 61. The automatic 

10 analysis engine 310 is launched. As a first step, the 
knowledge database 445 is accessed, all the rules are 
retrieved and they are put into a rule stack 4103 (block 483) . 
Then, it is checked whether there are rules in the rule stack 
4103 (block 485) to be applied. If there are rules in the 

15 stack (block 485, exit branch Y) , the first rule in the stack 
is taken from the rule stack 4103, for example on a first-in 
first-out basis, and the rule is applied (block 487). The 
validity of the rule context is fist ascertained (block 489) : 
in particular, the correctness of the rule sintax, and the 

20 existence of the rule positive and negative responses are 
verified. If it is ascertained that the rule context is not 
valid (block 489, exit branch N)', the rule is declared as 
invalid (block 491), no response is produced, and the next 
rule is taken from the rule stack 4103 (block 487) . If instead 

25 the rule context is ascertained to be valid (block 489, exit 
branch Y) , it is ascertained whether the current rule includes 
conditions still to be verified (block 493) . In the negative 
case (block 493, exit branch N) , a positive response is 
declared for the rule (block 495) , the proper response is 

30 taken from the response database (accessing the proper 
knowledge database among the knowledge databases 4 47 and 4 4 9) 
and the next rule is taken from the rule stack 4103 (connector 
B) . Otherwise (block 493, exit branch Y) , it is ascertained 
whether the condition context, i.e. the formal correctness of 

35 the rule conditions, is valid (block 497) . If the condition 
context is found invalid (block 497, exit branch N) , the rule 
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is declared invalid (block 491), no response is produced, and 
the next rule is taken from the rule stack 4103. If the 
condition context is found valid (block 497, exit branch Y), 
the rule condition is applied and it is ascertained if the 
5 condition is verified (block 499) . In the affirmative case 
(block 499, exit branch Y) , the next rule condition is 
assessed, otherwise (block 499, exit branch N) the rule is 
declared to have a negative response (block 4101) , the proper 
response is taken from the respknse database and the next rule 

10 is taken from the rule stack 4103 (connector B) . 

When no more rules are left in the stack (block 485, exit 
branch N) , the automatic analysis procedure 461 terminates. 

It can be appreciated that the application of the 
pre-defined automatic analysis rules may cause the knowledge 

15 database 447 to be accessed one or more times, and one or more 
additional questions to be selected from the additional 
question list of the knowledge database 44 7 and be added to 
the project database 455; similarly, the project database 449 
may be accessed one or more times, and one or more 

20 recommendations/suggestions and/or prescriptions of corrective 
actions be selected from the lists of the knowledge database 
449 and be added to the project database 457. 

It is pointed out that some analysis steps of the 
automatic analysis procedure, instead of being based on the 

25 predefined rules in the knowledge database 445, may be 
directly embedded in the code of the automatic analysis 
procedure 4 61. 

Once the automatic analysis procedure 461 has applied all 
the pre-defined rules present in the knowledge database 445, 
30 the project database 455 is accessed to check wether, during 
the execution of the automatic analysis procedure 4 61, 
additional questions have been added to the existing list of 
questions and associated answers (blocks 463 and 465) . 

In the affirmative case, a procedure 467 is activated for 
35 having the user get the information answering the additional 
questions. Similarly to the HA targets definition procedure 



FR920020082/EP1 



28 



459, the project database 455 is accessed and the additional 
Questions are retrieved; one or more menu pages are displayed, 
through the GUI 313, on the display device 115/211 of the 
client computer 115, with text boxes providing the additional 
5 questions in descriptive form, and the user is guided in the 
process of entering, through input boxes in the menu page or 
pages, the required additional information. 

The answer parameters obtained during the execution of the 
procedure 4 67 are stored in the project database 455; in 

10 particular, as schematically depicted in Figure 8 and 
similarly to the standard questions, for each additional 
question selected from the knowledge database 44 7 during the 
execution of the automatic analysis procedure 4 61, a new entry 
is created in the table of the project database 455; such an 

15 entry contains a first field, with a description of the 
additional question, and a second field with the corresponding 
answer obtained during the execution of the procedure 4 67. 

Once the additional questions have been answered, the 
procedure 4 67 ends, and the flow jumps back to the automatic 

20 analysis procedure 461 (connector A) . The automatic analysis 
procedure 4 61 is launched again, and a new automatic analysis 
is performed. In this way, those rules that could not be 
applied in the previous run of the automatic analysis 
procedure 4 61, because they included conditions that 

25 necessitated parameters obtained by answering the additional 
questions, can now be applied. It is observed that new 
additional questions can be added to the project database 455 
during the new run of the automatic analysis procedure 4 61, so 
that the process can be reiterated a number of times. 

30 If instead no additional questions are found in the 

project database 455, the project database 457 is accessed to 
ascertain whether one or more prescriptions of corrective 
actions have been generated during the execution of the 
automatic analysis procedure 461 (blocks 469 and 471) . 

35 In the affirmative case (block 471, exit branch Y) , a 

procedure is launched requesting the user to implement the 
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necessary corrective actions, retrieved from the project 
database 457 (block 473) . Through the GUI 313, one or more 
menu pages are displayed on the display device 115/211 of the 
client computer 115, The menu page or pages include text 
5 windows, wherein a human-readable description of the required 
corrective actions needing to be implemented on the data 
processing system 100 is displayed, and associated input 
boxes, which the user can activate to declare that the 
required corrective actions have been or will be implemented. 

10 The user is left free to either implement none, some or all of 
the required corrective actions at this time, before the HA 
data processing environment definition procedure goes on, or 
to implement them at a later time. 

In an embodiment of the present invention, one or more of 

15 the prescribed corrective actions, once authorised by the 
user, are automatically implemented on the data processing 
system 100, particularly on the production server computer 
105a. In the following, two possible corrective actions that 
can be automatically implemented by the HA definition software 

20 tool will be described, assuming again by way of example that 
the production server computer 105a is a machine of the family 
iSeries Servers by IBM Corporation. 

As a first example, let the following rule definition (in 
XML language) be considered: 



<rule 

responseCategory="LIB" 
pos±tiveResponseKey="outqnoaudlvl" 
nega tiveResponseKey=""> 
30 <condition 

field="OUTQ" 

tables="lib" 

where="OUTQUE>0 and SLTLIB> 
regroupMethod="*" 
3 5 compa ri s on="> " 
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refer en ceVa lue="0" 

/> 

<condition 

field=" AUDLV" 
5 tables="sys" 

where=" AUDLV NOT LIKE *%*SPLFDTA%" 
regroupMethod^"*" 
compa ri son="> " 
refer en ceVa lue="0" 

10 /> 

</rule> 



The rule has a positive result if, among the libraries 
selected for mirroring, there is one or more libraries 

15 containing print queues (field OUTQUE in the file LIB not 
equal to 0) and at the same time the settings of the system 
value specifying the level of auditing functionalities of the 
production server computer 105a (field AUDLV in the file SYS) 
does not include the setting SPLFDTA (a setting enabling 

20 auditing of spooled file functions) . In case the application 
of this rule provides a positive result, the following 
descriptive prescription of corrective action is taken from 
the knowledge database 449 and is added to the project 
database 457: 

25 Print queue objects found in libraries selected for mirroring 
- system value AUDLV needs to be updated to include setting 
SPLFDTA. 

If the user (preferably after having informed and obtained 
autohorisation from the owner of the data processing system 

30 100) grants the authorisation for automatically implementing 
the proposed corrective action, the following system command 
(stored in the knowledge database 449 together with the 
corresponding descriptive description of corrective action) is 
issued by the HA definition software tool to the production 

35 server computer 105a: 
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CHGSYSVAL SYSVAL (AUDLV) VALUE (** CREATE * DELETE *SAVRST 
*SPLFDTA) 

where CREATE, DELETE and SAVRST represent exemplary current 
settings of the system value AUDLV in the production server 
5 computer 105a, before the corrective action is implemented. As 
a consequence of the application of this system command to the 
production server computer 105a, the settings of the system 
value AUDLV are updated to include the desired setting 
SPLFDTA. 

10 As a second example, let the following rule definition (in 

XML language) be considered: 



<rule 

responseCategory="LIB" 
15 posi tiveResponseKey : ="oldobj " 

negat±veResponseKey=""> 
<condit±on 

field="OBJDSC" 
tables="obj" 

20 where="OBJDSC LIKE *%old%' 

regroupMethod=" * " 
compa rison= "> " 
refer en ceVa lue="0" 

/> 

25 </rule> 



This second exemplary rule provides a positive response if 
there are objects containing in the field OBJDSC the character 
string old, possibly meaning that such objects are obsolete 
30 and can be cancelled (preferably, the rule will include 
conditions, not explicitly shown, directed to limiting the 
check to the objects belonging to libraries selected for 
mirroring) • In case the application of this rule provides a 
positive result, the following descriptive prescription of 
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corrective action is taken from the knowledge database 449 and 
is added to the project database 4 57: 

Objects described as "old" found in the libraries selected for 
mirroring - if obsolete f please delete these objects. 
5 If the user grants the authorisation for automatically 

implementing the proposed corrective action, the following 
system command or commands are issued by the HA definition 
software tool to the production server computer 105a: 
DLTF FILE (filepath/ filename object) 

10 where filepath/ file object means the path and file name of the 
object file to be deleted. 

In other words, and in general terms, in the knowledge 
database 449 containing the list of recommendations and the 
list of prescriptions of corrective actions, one or more 

15 prescriptions of corrective actions are accompanied by 
commands that, if the automatic implementation of the 
corresponding corrective actions is authorised by the user, 
can be automatically issued by the HA definition software tool 
to the data processing system 100, in order to automatically 

20 implement the corresponding corrective actions. The commands 
stored in the knowledge database may be command templates, 
which are completed by parameters retrieved from the project 
database 451, particularly from the project database 453 (as 
in the case of the two examplary rules provided before) . In 

25 general, only some corrective actions are of such a nature 
that automatic irtiplementantion thereof is possible; the 
implementation of other corrective actions, such as increasing 
the hard disk(s) storage space of the productionm server 
computer 1.05a or upgrading the OS version, is left to the 

30 user. 

Figure 4D is a schematic flowchart providing an overview 
of the procedure 473 requesting the user to implement the 
corrective actions, in an embodiment of the present invention. 
The presence of prescription of corrective actions in the 
35 project database 457 is first checked (block 4120) . In the 
negative case (block 4120, exit branch N) the procedure ends, 
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otherwise (exit branch Y) the first corrective action 
prescription is retrieved from the project database, for 
example on a first-in, first-out basis (block 4123) . Then, it 
is ascertained whether the corrective action can be 
5 automatically implemented (block 4125) ; this can for example 
rely on the presence in the project database 457 of commands 
associated with the proposed corrective action. In the 
affirmative case (block 4125, exit branch Y) , authorisation is 
requested to the user for automatically implementing the 

10 corrective action (block 4127); for example, through the GUI 
313, a menu page similar to that shown in Figure 6 is 
displayed to the user on the display 115/211 of the client 
computer 115; the prescription of corrective action is 
displayed in descriptive form, possibly together with the 

15 command or commands proposed for automatically implementing 
the corrective action; through a check box, the user can 
select whether or not to grant the authorisation for 
automatically implementing the corrective action. Preferably, 
before deciding whether to implement a corrective action, the 

20 user shall get the authorisation by the owner of the data 
processing system 100. If the authorisation is granted (block 
4129, exit branch Y) , the HA definition software tool 
automatically implement the corrective action on the data 
processing system, for example by issuing the predefined 

25 system command or commands described in the foregoing to the 
production server computer 105a (block 4131) ; it is observed 
that this may imply that the client computer 115 connects to 
the production server computer 105a (or to a network server 
computer) with system manager privileges, and issues remote 

30 system commands. The corrective action is then marked as 
implemented in the project database 457 (block 4133) . If on 
the contrary the authorisation is not granted (block 4129, 
exit branch N) , no action is undertaken, and the flow loops 
back to block 4120. In case the corrective action is one that 

35 cannot be automatically implemented (for example, a corrective 
action involving updating the version of the OS installed on 
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the production server computer 105a) , the user is requested to 
implement the corrective action (block 4135) ; for example, 
through the GUI 313 , a menu page is displayed in which a 
description of the proposed corrective action is shown. The 
5 user is left free to take the necessary steps for implementing 
the corrective action at this time or at a later time; in case 
the user decides to implement the corrective action at this 
time, the user selects the corrtective action through a check 
box. If the corrective action is detected as implemented at 

10 this time (block 4137, exit branch Y) , the corrective action 
is marked as implemented in the project database 457 (block 
4133) . The flow loops back to block 4120, and all the above 
described actions are repeated until all the prescriptions of 
corrective actions in the project database 457 are checked. 

15 Coming back to Figure 4B, in the case at least one of the 

prescribed corrective actions is implemented during the 
procedure 473, either by the user or automatically by the HA 
definition software tool, on the data processing system 100 at 
this time (block 475, exit branch N) , a new inspection of the 

20 production server computer 105a is normally needed. The HA 
environment definition routine 423 terminates returning an 
output code corresponding to a request of a new inspection ■ of 
the production server computer 105a (block 477) . 

It is pointed out that, in the practice, it may happen 

25 that some corrective actions, albeit implemented at this time 
on the data processing system 100, either by the user or 
automatically, are of such a nature that a new inspection of 
the data processing system 100 is not necessary. In this case, 
no request is made of a . new production server computer 

30 inspection. 

In case all the required corrective actions are postponed 
(block 475, exit branch Y) , or if no prescriptions of 
corrective actions are found in the project database 457, a 
procedure is launched for generating a project report (block 
35 479) . The project databases 453, 455 and 457 are accessed, the 
data stored therein are retrieved and they are merged to 
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generate an output report 481. The output report 481 includes 
the production server computer parameters, retrieved by the 
automatic inspection routine 407 from the production server 
105a, the list of standard and, if any, additional questions 
5 generated during the execution of the procedures 459 and 4 61, 
and the associated answers obtained, the list of 
recommendations/suggestions and the list of corrective action 
prescriptions, with associated indications of whether the 
corrective actions have been implemented. 

10 The output report 481 includes a human-readable output 

report, for example formatted in HTML, that provides to the 
user all the necessary information on the data processing 
system- 100, and all the indications, prescriptions of 
corrective actions and/or recommendations /suggest ions 

15 necessary for implementing the desired HA data processing 
environment in the data processing system 100. For the 
generation of the human-readable output report, a report 
template is exploited. The report template is for example an 
HTML file, containing pre-defined tags; the project report 

20 generation procedure 479 substitutes every tag in the report 
template with a corresponding project parameter, extracted 
from the project database 315. The human-readable output 
report provides to an HA expert all the information necessary 
for setting up the desired HA data processing environment in 

25 the data processing system 100. In particular, the 
human-readable output report contains detailed information on 
which corrective actions have already been implemented during 
the HA data processing environment definition phase, either by 
the user or automatically, and which corrective actions are 

30 instead still to be implemented. 

In addition to the human-readable output report, a 
machine-readable output report may be generated, for example 
formatted in XML language. The machine-readable output report 
can be exploited by a software tool for automatically 

35 implementing the HA data processing environment. For example, 
on the basis of the output report , the HA implementing 
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software tool can automatically generate resource groups or 
clusters, by means of an automatic "collocation" process (in 
jargon, a process involving establishing binary links between 
the resources in the group) and mirror the generated resource 
5 group clusters into the back-up production server 105a-bk. 

Thanks to the method according to the present invention, 
the process of analysing and tuning an existing data 
processing system with the purpose of setting up an HA data 
processing environment is rendered substantially error-free, 
10 fast and not necessarily subjected to the intervention of a 
highly qualified HA expert. 
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CLAIMS 

1- A method for tuning a data processing system (100) for 
setting up a highly-available data processing environment, 
comprising: 

defining a set of high-availability rules (445); 

defining a set of parameters to be obtained (300, 443, 447) , 
said parameters being indicative of the compliance of the 
data processing system with the high-availability rules; 

obtaining (400-417,459) the set of parameters (453,455), 
said act of obtaining the set of parameters including 
automatically inspecting (407) the data processing system 
for identifying and collecting data processing system 
parameters (303) ; 

automatically evaluating (461) the obtained set of 
parameters by applying the high-availability rules, and 

responsive to said act of evaluating, determining (461,449) 
a set of tuning actions for the setting up of the 
highly-available data processing environment. 

2. The method of claim 1, in which said data processing system 

parameters include parameters identifying resources of the 
data processing system. 

3. The method of claim 2, in which said act of obtaining the 

set of parameters further includes: 
having a user inputting (459) user-defined parameters 
relevant to the setting up of the highly-available 
environment, the user-defined parameters including 
parameters that are not automatically retrievable from the 
data processing system by said act of automatically 
inspecting the data processing system. 

4. The method of claim 3, in which said user-defined 
parameters include parameters defining which of the 
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resources of the data processing system are to be part of 
the highly-available data processing environment. 

5. The method of claim 4, in which said act of automatically 

evaluating the obtained set of parameters includes: 
having the user inputting additional parameters indicative 
of the compliance of the data processing system with the 
high-availability rules . 

6. The method of claim 4 or 5, in which said act of 
determining a set of tuning actions includes: 

providing a set of prescriptions of tuning actions to be 
applied to the data processing system for preparing the 
data processing system to the setting up of the 
highly-available data processing environment. 

7. The method of claim 6, further comprising: 

having the user apply (4135) the tuning actions to the data 
processing system. 

8. The method of claim 6 or 7, further comprising: 
automatically applying (4131) the tuning actions to the data 

processing system. 

9. The method of claim 7 or 8 , further comprising: 
repeating (477,425) said acts of automatically inspecting 

the data processing system for identifying and collecting 
data processing system parameters and automatically 
evaluating the obtained set of parameters after the tuning 
actions are applied to the data processing system. 

10. The method of any one of the preceding claims, further 
comprising: 

generating (479) an output report (481) including the set of 
obtained parameters and a list of tuning actions for the 
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setting up of the highly-available data processing 
environment . 

11. A computer program directly loadable into a working memory 
(115/207,1053/207) of a data processing system (100) for 
actuating the method of any one of claims 1 to 10 when the 
program is executed. 

12. The computer program according to claim 11, comprising 

a server computer program module directly loadable into a 
working memory (105a/207) of a server computer (105a) of 
the data processing system (100), and 

a client computer program module directly loadable into a 
working memory (115/207) of a client computer (115) of the 
data processing system, 

the server computer program module performing the acts of: 

automatically inspecting (407) the data processing system 
for automatically identifying and collecting the data 
processing system parameters (303), and 

transferring (419) to the client computer the collected data 
processing system parameters; 

the client computer program module performing the acts of: 
obtaining (400-417,459) the set of parameters (453,455) 
indicative of the compliance of the data processing system 
with the high-availability rules, said act of obtaining 
the set of parameters including receiving (417) from the 
server computer the collected data processing system 
parameters; 

automatically evaluating (461) the set of parameters by 
applying the high-availability rules; and 
determining the set of tuning actions for the setting up of 
the highly-available data processing environment. 

13. A computer program directly loadable into a working memory 
of a server computer (105a) in a data processing system 
(100), for performing, when executed, the acts of: 
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automatically inspecting (407) the data processing system 
for identifying and collecting data processing system 
parameters (303) indicative " of the compliance of the data 
processing system with predefined high-availability rules; 
and 

transferring (419) the collected data processing system 
parameters to a client computer (115) of the data 
processing system. 

4 • A computer program directly loadable into a working memory 
of a client computer (115) in a data processing system 
(100) , for performing, when executed, the acts of: 

obtaining (400-417,459) a set of parameters (453,455) 
indicative of the compliance of the data processing system 
with predefined high-availability rules, said act of 
obtaining the set of parameters including receiving (417) 
from a server computer (105a) of the data processing 
system server computer data processing system parameters 
(303) automatically collected by the server computer 
(105a) by inspecting the data processing system; 

automatically evaluating (461) the obtained set of 
parameters by applying the predetermined high-availability 
rules to the set of parameters; and 

determining a set of tuning actions for the setting up of 
the highly-available data processing environment. 

5. A computer program product comprising a computer readable 
media on which the computer program of any one of claims 
11 to 14 is stored. 

6. A data processing system (100) including at least one 
computer (105a, 115) comprising; 

a knowledge database (445) including a plurality of 

predetermined high-availability rules; 
means for obtaining a set of parameters indicative of the 

compliance of the data processing system with predefined 
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high-availability rules, said means for obtaining the set 
of parameters including means (300) for automatically 
inspecting the data processing system and for collecting 
data processing system parameters; 

means for automatically evaluating (461) the obtained set 
of parameters by applying the predetermined 
high-availability rules to the obtained set of parameters; 
and 

means for determining a set of tuning actions (473) for the 
setting up of the highly-available data processing 
environment . 
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METHOD FOR TUNING A DATA PROCESSING SYSTEM FOR SETTING UP A 
HIGH -AVAILABILITY DATA PROCESSING ENVIRONMENT 



Abstract 

A method for tuning a data processing system (100) for 
5 setting up a highly-available data processing environment, 
comprising: defining a set of high-availability rules (445); 
defining a set of parameters to be obtained (300,443,447), 
indicative of the compliance of the data processing system 
with the high-availability rules; obtaining (400-417,459) the 

10 set of parameters (453,455), including automatically 
inspecting (407) the data processing system for identifying 
and collecting data processing system parameters (303); 
automatically evaluating (461) the obtained set of parameters 
by applying the high-availability rules and, responsive to the 

15 act of evaluating, determining (461, 449) a set of tuning 
actions for the setting up "of the highly-available data 
processing environment . 
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